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4 <110> APPLICANT: Dunn -Coleman, Nigel 

5 Goedegebuur, Frits 

6 Ward, Michael 

7 Yao, Jian 

9 <120> TITLE OF INVENTION: EGVIII Endoglucanase and Nucleic Acids 
10 Encoding the Same 

12 <130> FILE REFERENCE: GC700 

14 <140> CURRENT APPLICATION NUMBER: US 10/028,245 

15 <141> CURRENT FILING DATE: .JJJTt} 1 - I^^^TS 
17 <160> NUMBER OF SEQ ID NOS / 

19 <170> SOFTWARE: FastSEQ foi^-windows Version 4.0 

21 <210> SEQ ID NO: 1 

22 <211> LENGTH: 1826 

23 <212> TYPE: DNA 

24 <213> ORGANISM: Trichoderma reesei 

26 <400> SEQUENCE: 1 

27 gtcgacccac gcgtccgttc attcttcctc ccctcctcct cctcctcctc ctccccttct 60 

28 cccccatcac cgtcaccact ctcctcattg ccgctctctc tgcgagccat gacgcagcat 120 

29 taacacacac tcgtttctgt tactctcgct gtcgtcggct ctgctcgttg gcattctgct 180 

30 tagcactttg tttcgttctt cgttctcttt taatccgtca tcttctgcaa tctgctgcca 240 

31 tttgttcgac taggtagtgg taatatacgg acagcttttt ttccctcgct caacacgtcg * 300 

32 acgtacaatt aatacaccat ctcgttaatc ggatatatcc ctcggcctct tcctggtgct 3 60 

33 tgtgcgacgc tcctcgtttc tccctctcat tatgcgcgca acctcccttc tggccgccgc 420 

34 cttggccgtg gctggcgatg ccctcgccgg caagatcaaa tatctgggcg tcgccattcc 480 

35 cggaatcgac tttggctgcg acatcgacgg cagctgtccg actgacacgt cgtctgtgcc 540 

36 cctgctgagc tacaaaggag gagatggcgc cggccagatg aagcatttcg ccgaagacga 600 

37 cggcctcaac gtctttcgca tatccgctac atggcagttt gtcctcaaca acacggtgga 660 

38 cggcaagctg gacgagctca actggggctc ctacaacaag gtcgtcaacg cctgtctcga 720 

39 gacgggcgcc tactgcatga ttgacatgca caactttgcc cgctacaacg gcggcatcat 780 

40 cggccaggga ggcgtgtcgg acgacatctt tgtcgacctc tgggtccaga tcgcaaagta 840 

41 ctacgaggac aacgacaaga tcatctttgg cctgatgaac gagccgcacg acctcgacat 900 

42 tgagatctgg gcgcagacgt gccaaaaggt cgtcactgcg atccgaaagg ccggcgccac 960 
4 3 ctcgcagatg atcctcctgc ccggaaccaa ctttgccagc gtcgagacgt atgtgtccac 1020 

44 tggcagcgcg gaagccctcg gcaagattac gaacccggat ggaagcaccg atttgctgta 1080 

45 ctttgatgtc cacaagtatc tcgacatcaa caactccggg tcgcacgccg agtgcaccac 1140 

46 agacaacgtc gacgccttca acgacttcgc ggactggctg aggcagaaca agcgccaggc 1200 

47 catcatctcc gaaacgggcg cgtccatgga accttcgtgc atgactgcct tctgcgccca . 1260 

48 gaacaaggcc attagcgaaa acagcgacgt ctacattggc tttgtgggct ggggtgccgg 1320 

49 cagctttgac acgtcgtaca tcttgactct gactcccctc ggcaagcccg gcaactacac 1380 

50 cgacaacaag ctcatgaacg agtgcattct ggaccagttt accctcgacg aaaagtaccg 1440 

51 tccaacaccc acctcaattt ccacagcggc ggaagagacg gccacggcga cagcaacctc 1500 

52 tgacggcgac gcgccatcca ctacgaagcc catctttagg gaagaaaccg cctctcccac 1560 

53 tcccaatgct gttaccaagc cctcgcccga cacgagcgac tcttccgacg acgacaagga 1620 
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54 ctcggcagca tctatgagtg cccagggctt gacaggcacg gtgctgttta ctgttgctgc 

55 ccttggctac atgctggtag cgttttgatg tttttttttt aatgagtttg tatacctaat 

56 gagcatgatt gagatgctac gtagtatata tgtctttacg ggtacataag actagagcca 

57 tgttgtaatc aaaaaaaaaa aaaaaa 

59 <210> SEQ ID NO: 2 

60 <211> LENGTH: 419 

61 <212> TYPE: PRT 

62 <213> ORGANISM: Trichoderma reesei 

64 <400> SEQUENCE: 2 

65 Gly Lys lie Lys Tyr Leu Gly Val Ala lie Pro Gly lie Asp Phe Gly 

66 1 5 • 10 .15 

67 Cys Asp lie Asp Gly Ser Cys Pro Thr Asp Thr Ser Ser Val Pro Leu 

68 20 25 30 

69 Leu Ser Tyr Lys Gly Gly Asp Gly Ala Gly Gin Met Lys His Phe Ala 

70 35 40 45 

71 Glu Asp Asp Gly Leu Asn Val Phe Arg lie Ser Ala Thr Trp Gin Phe 

72 50 55 60 

73 Val Leu Asn Asn Thr Val Asp Gly Lys Leu Asp Glu Leu Asn Trp Gly 

74 65 ' 70 75 80 

75 Ser Tyr Asn Lys Val Val Asn Ala Cys Leu Glu Thr Gly Ala Tyr Cys 

76 85 90 95 

77 Met lie Asp Met His Asn Phe Ala Arg Tyr Asn Gly Gly lie lie Gly 

78 100 105 110 

79 Gin Gly Gly Val Ser Asp Asp lie Phe Val Asp Leu Trp Val Gin lie 

80 115 120 125 

81 Ala Lys Tyr Tyr Glu Asp Asn Asp Lys lie lie Phe Gly Leu Met Asn 

82 130 135 140 

83 Glu Pro His Asp Leu Asp lie Glu lie Trp Ala Gin Thr Cys Gin Lys 

84 145 150 155 160 

85 Val Val Thr Ala He Arg Lys Ala Gly Ala Thr Ser Gin Met He Leu 

86 165 170 175 

87 Leu Pro Gly Thr Asn Phe Ala Ser Val Glu Thr Tyr Val Ser Thr Gly 

88 180 185 190 

89 Ser Ala Glu Ala Leu Gly Lys He Thr Asn Pro Asp Gly Ser Thr Asp 

90 195 200 205 

91 Leu Leu Tyr Phe Asp Val His Lys Tyr Leu Asp He Asn Asn Ser Gly 

92 210 215 220 

93 Ser His Ala Glu Cys Thr Thr Asp Asn Val Asp Ala Phe Asn Asp Phe 

94 225 230 235 240 

95 Ala Asp Trp Leu Arg Gin Asn Lys Arg Gin Ala He He Ser Glu Thr 

96 245 250 255 

97 Gly Ala Ser Met Glu Pro Ser Cys Met Thr Ala Phe Cys Ala Gin Asn 

98 260 265 270 

99 Lys Ala He Ser Glu Asn Ser Asp Val Tyr He Gly Phe Val Gly Trp 

100 275 280 285 

101 Gly Ala Gly Ser Phe Asp Thr Ser Tyr He Leu Thr Leu Thr Pro Leu 

102 290 295 * 300 

103 Gly Lys Pro Gly Asn Tyr Thr Asp Asn Lys Leu Met Asn Glu Cys He 

104 305 310 315 320 



1680 
1740 
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105 Leu Asp Gin Phe Thr Leu Asp Glu Lys Tyr Arg Pro Thr Pro Thr Ser 

106 325 330 335 

107 lie Ser Thr Ala Ala Glu Glu Thr Ala Thr Ala Thr Ala Thr Ser Asp 

108 340 345 350 

109 Gly Asp Ala Pro Ser Thr Thr Lys Pro lie Phe Arg Glu Glu Thr Ala 

110 355 360 365 

111 Ser Pro Thr Pro Asn Ala Val Thr Lys Pro Ser Pro Asp Thr Ser Asp 

112 370 375 380 

113 Ser Ser Asp Asp Asp Lys Asp Ser Ala Ala Ser Met Ser Ala Gin Gly 

114 385 390 395 400 

115 Leu Thr Gly Thr Val Leu Phe Thr Val Ala Ala Leu Gly Tyr Met Leu 

116 405 410 415 

117 Val Ala Phe 

120 <210> SEQ ID NO: 3 

121 <211> LENGTH: 19 

122 <212> TYPE: PRT 

123 <213> ORGANISM: Trichoderma reesei 

125 <400> SEQUENCE: 3 

126 Met Arg Ala Thr Ser Leu Leu Ala Ala Ala Leu Ala Val Ala Gly Asp 

127 15 10 15 

128 Ala Leu Ala 

131 <210> SEQ ID NO: 4 

132 <211> LENGTH: 1317 

133 <212> TYPE: DNA 

134 <213> ORGANISM: Trichoderma reesei 

136 <400> SEQUENCE: 4 

137 atgcgcgcaa cctcccttct ggccgccgcc ttggccgtgg ctggcgatgc cctcgccggc 60 

138 aagatcaaat atctgggcgt cgccattccc ggaatcgact ttggctgcga catcgacggc 120 

139 agctgtccga ctgacacgtc gtctgtgccc ctgctgagct acaaaggagg agatggcgcc 180 

140 ggccagatga agcatttcgc cgaagacgac ggcctcaacg tctttcgcat atccgctaca 240 

141 tggcagtttg tcctcaacaa cacggtggac ggcaagctgg acgagctcaa ctggggctcc 300 

142 tacaacaagg tcgtcaacgc ctgtctcgag acgggcgcct actgcatgat tgacatgcac 360 

143 aactttgccc gctacaacgg cggcatcatc ggccagggag gcgtgtcgga cgacatcttt 420 

144 gtcgacctct gggtccagat cgcaaagtac tacgaggaca acgacaagat catctttggc 480 

145 ctgatgaacg agccgcacga cctcgacatt gagatctggg cgcagacgtg ccaaaaggtc 540 

146 gtcactgcga tccgaaaggc cggcgccacc tcgcagatga tcctcctgcc cggaaccaac 600 

147 tttgccagcg tcgagacgta tgtgtccact ggcagcgcgg aagccctcgg caagattacg 660 
14 8 aacccggatg gaagcaccga tttgctgtac tttgatgtcc acaagtatct cgacatcaac 720 

149 aactccgggt cgcacgccga gtgcaccaca gacaacgtcg acgccttcaa cgacttcgcg 780 

150 gactggctga ggcagaacaa gcgccaggcc atcatctccg aaacgggcgc gtccatggaa 840 

151 ccttcgtgca tgactgcctt ctgcgcccag aacaaggcca ttagcgaaaa cagcgacgtc 900 

152 tacattggct ttgtgggctg gggtgccggc agctttgaca cgtcgtacat cttgactctg 960 

153 actcccctcg gcaagcccgg caactacacc gacaacaagc tcatgaacga gtgcattctg 1020 

154 gaccagttta ccctcgacga aaagtaccgt ccaacaccca cctcaatttc cacagcggcg 1080 

155 gaagagacgg ccacggcgac agcaacctct gacggcgacg cgccatccac tacgaagccc 1140 

156 atctttaggg aagaaaccgc ctctcccact cccaatgctg ttaccaagcc ctcgcccgac 1200 

157 acgagcgact cttccgacga cgacaaggac tcggcagcat ctatgagtgc ccagggcttg 12 60 

158 acaggcacgg tgctgtttac tgttgctgcc cttggctaca tgctggtagc gttttga 1317 
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